Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Amulti-evidence, multi-engine OCR system

Identifieur interne : 000F44 ( Main/Exploration ); précédent : 000F43; suivant : 000F45

Amulti-evidence, multi-engine OCR system

Auteurs : Ilya Zavorin [États-Unis] ; Eugene Borovikov [États-Unis] ; Anna Borovikov [États-Unis] ; Luis Hernandez [États-Unis] ; Kristen Summers [États-Unis] ; Mark Turner [États-Unis]

Source :

RBID : Pascal:08-0459039

Descripteurs français

English descriptors

Abstract

Although modern OCR technology is capable of handling a wide variety of document images, there is no single OCR engine that performs equally well on all documents for a given single language script. Naturally, each OCR engine has its strengths and weaknesses, and therefore different engines tend to differ in the accuracy on different documents, and in the errors on the same document image. While the idea. of using multiple OCR engines to boost output accuracy is not new, most of the existing systems do not go beyond variations on majority voting. While this approach may work well in many cases, it has limitations, especially when OCR technology used to process a given script has not yet fully matured. Our goal is to develop a system called MEMOE (for "Multi-Evidence Multi-OCR-Engine") that combines, in an optimal or near-optimal way, output streams of one or more OCR engines together with various types of evidence extracted from these streams as well as from original document images, to produce output of higher quality than that of the individual OCR engines, or of majority voting applied to multiple OCR output streams. Furthermore, we aim to improve the accuracy of OCR output on images that might otherwise have low accuracy that significantly impacts downstream processing. The MEMOE system functions as an OCR engine taking document images and some configuration parameters as input and producing a single output text stream. In this paper, we describe the design of the system, various evidence types and how they are incorporated into MEMOE in the form of filters. Results of initial tests that involve two corpora of Arabic documents show that, even in its initial configuration, the system is superior to a voting algorithm and that even more improvement may be achieved by incorporating additional evidence types into the system.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Amulti-evidence, multi-engine OCR system</title>
<author>
<name sortKey="Zavorin, Ilya" sort="Zavorin, Ilya" uniqKey="Zavorin I" first="Ilya" last="Zavorin">Ilya Zavorin</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>CACI International Inc, 4831 Walden Lane</s1>
<s2>Lanham, MD 20706</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Borovikov, Eugene" sort="Borovikov, Eugene" uniqKey="Borovikov E" first="Eugene" last="Borovikov">Eugene Borovikov</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>CACI International Inc, 4831 Walden Lane</s1>
<s2>Lanham, MD 20706</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Borovikov, Anna" sort="Borovikov, Anna" uniqKey="Borovikov A" first="Anna" last="Borovikov">Anna Borovikov</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>CACI International Inc, 4831 Walden Lane</s1>
<s2>Lanham, MD 20706</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Hernandez, Luis" sort="Hernandez, Luis" uniqKey="Hernandez L" first="Luis" last="Hernandez">Luis Hernandez</name>
<affiliation wicri:level="2">
<inist:fA14 i1="02">
<s1>Army Research Laboratory</s1>
<s2>Adelphi, MD</s2>
<s3>USA</s3>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Summers, Kristen" sort="Summers, Kristen" uniqKey="Summers K" first="Kristen" last="Summers">Kristen Summers</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>CACI International Inc, 4831 Walden Lane</s1>
<s2>Lanham, MD 20706</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Turner, Mark" sort="Turner, Mark" uniqKey="Turner M" first="Mark" last="Turner">Mark Turner</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>CACI International Inc, 4831 Walden Lane</s1>
<s2>Lanham, MD 20706</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">08-0459039</idno>
<date when="2007">2007</date>
<idno type="stanalyst">PASCAL 08-0459039 INIST</idno>
<idno type="RBID">Pascal:08-0459039</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000257</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000527</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000287</idno>
<idno type="wicri:Area/Main/Merge">000F58</idno>
<idno type="wicri:Area/Main/Curation">000F44</idno>
<idno type="wicri:Area/Main/Exploration">000F44</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Amulti-evidence, multi-engine OCR system</title>
<author>
<name sortKey="Zavorin, Ilya" sort="Zavorin, Ilya" uniqKey="Zavorin I" first="Ilya" last="Zavorin">Ilya Zavorin</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>CACI International Inc, 4831 Walden Lane</s1>
<s2>Lanham, MD 20706</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Borovikov, Eugene" sort="Borovikov, Eugene" uniqKey="Borovikov E" first="Eugene" last="Borovikov">Eugene Borovikov</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>CACI International Inc, 4831 Walden Lane</s1>
<s2>Lanham, MD 20706</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Borovikov, Anna" sort="Borovikov, Anna" uniqKey="Borovikov A" first="Anna" last="Borovikov">Anna Borovikov</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>CACI International Inc, 4831 Walden Lane</s1>
<s2>Lanham, MD 20706</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Hernandez, Luis" sort="Hernandez, Luis" uniqKey="Hernandez L" first="Luis" last="Hernandez">Luis Hernandez</name>
<affiliation wicri:level="2">
<inist:fA14 i1="02">
<s1>Army Research Laboratory</s1>
<s2>Adelphi, MD</s2>
<s3>USA</s3>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Summers, Kristen" sort="Summers, Kristen" uniqKey="Summers K" first="Kristen" last="Summers">Kristen Summers</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>CACI International Inc, 4831 Walden Lane</s1>
<s2>Lanham, MD 20706</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Turner, Mark" sort="Turner, Mark" uniqKey="Turner M" first="Mark" last="Turner">Mark Turner</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>CACI International Inc, 4831 Walden Lane</s1>
<s2>Lanham, MD 20706</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Proceedings of Electronic Imaging Science and Technology</title>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Proceedings of Electronic Imaging Science and Technology</title>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Accuracy</term>
<term>Algorithms</term>
<term>Arabic</term>
<term>Document image processing</term>
<term>Downlink</term>
<term>Image processing</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Voting</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Algorithme</term>
<term>Reconnaissance optique caractère</term>
<term>Traitement image document</term>
<term>Précision</term>
<term>Vote</term>
<term>Canal descendant</term>
<term>Arabe</term>
<term>Reconnaissance forme</term>
<term>Traitement image</term>
<term>0130C</term>
<term>4230S</term>
<term>4230V</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Vote</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Although modern OCR technology is capable of handling a wide variety of document images, there is no single OCR engine that performs equally well on all documents for a given single language script. Naturally, each OCR engine has its strengths and weaknesses, and therefore different engines tend to differ in the accuracy on different documents, and in the errors on the same document image. While the idea. of using multiple OCR engines to boost output accuracy is not new, most of the existing systems do not go beyond variations on majority voting. While this approach may work well in many cases, it has limitations, especially when OCR technology used to process a given script has not yet fully matured. Our goal is to develop a system called MEMOE (for "Multi-Evidence Multi-OCR-Engine") that combines, in an optimal or near-optimal way, output streams of one or more OCR engines together with various types of evidence extracted from these streams as well as from original document images, to produce output of higher quality than that of the individual OCR engines, or of majority voting applied to multiple OCR output streams. Furthermore, we aim to improve the accuracy of OCR output on images that might otherwise have low accuracy that significantly impacts downstream processing. The MEMOE system functions as an OCR engine taking document images and some configuration parameters as input and producing a single output text stream. In this paper, we describe the design of the system, various evidence types and how they are incorporated into MEMOE in the form of filters. Results of initial tests that involve two corpora of Arabic documents show that, even in its initial configuration, the system is superior to a voting algorithm and that even more improvement may be achieved by incorporating additional evidence types into the system.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Maryland</li>
</region>
</list>
<tree>
<country name="États-Unis">
<region name="Maryland">
<name sortKey="Zavorin, Ilya" sort="Zavorin, Ilya" uniqKey="Zavorin I" first="Ilya" last="Zavorin">Ilya Zavorin</name>
</region>
<name sortKey="Borovikov, Anna" sort="Borovikov, Anna" uniqKey="Borovikov A" first="Anna" last="Borovikov">Anna Borovikov</name>
<name sortKey="Borovikov, Eugene" sort="Borovikov, Eugene" uniqKey="Borovikov E" first="Eugene" last="Borovikov">Eugene Borovikov</name>
<name sortKey="Hernandez, Luis" sort="Hernandez, Luis" uniqKey="Hernandez L" first="Luis" last="Hernandez">Luis Hernandez</name>
<name sortKey="Summers, Kristen" sort="Summers, Kristen" uniqKey="Summers K" first="Kristen" last="Summers">Kristen Summers</name>
<name sortKey="Turner, Mark" sort="Turner, Mark" uniqKey="Turner M" first="Mark" last="Turner">Mark Turner</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000F44 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000F44 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:08-0459039
   |texte=   Amulti-evidence, multi-engine OCR system
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024